Question Bank: Unit 1-Relational Database Design 


Short Question and Answer 


1. 


Dr. 


What is a functional dependency in database management system? 


A functional dependency is a relationship between two attributes in a database table, where the value of 


one attribute uniquely determines the value of another attribute. 
What is E.F. Codd's Rule in database management system? 


E.F. Codd's Rule is a set of 12 principles that define what constitutes a truly relational database 


management system (RDBMS). 
What is the first normal form in database management system? 


The first normal form (1NF) requires that each attribute in a relation omtable contains only atomic values, 


meaning that it cannot be further decomposed into smaller values. 
What is the second normal form in database management system? 


The second normal form (2NF) requires that all non-keyattributes in a relation or table are fully dependent 


on the primary key. 
What is the third normal form in database management system? 


The third normal form (3NF) requires‘thatjall non-key attributes in a relation or table are non-transitively 


dependent on the primary key. 
How are functional dependencies used in database management system? 


Functional dependenciés are used to determine the appropriate level of normalization for a database, 


which helps to reduce*data redundancy and improve data integrity. 

What is a superkey in database management system? 

A superkey is a set of attributes in a relation or table that uniquely identifies each tuple or row in the table. 
How, do you identify the primary key in a relation? 


The primary key in a relation can be identified by finding the attribute or set of attributes that uniquely 


identifies each tuple or row in the table. It must also be non-null and unique. 


What is a candidate key in database management system? 
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A candidate key is a minimal set of attributes in a relation or table that can uniquely identify each tuple or 


row in the table. 
Why is normalization important in database management system? 


Normalization is important in database management system to reduce data redundancy, improve data 


consistency and integrity, and make the database more efficient and easier to maintain. 
What is the difference between a functional dependency and a partial dependency? 


A functional dependency exists when the value of one attribute uniquely determines the value’ of another 
attribute. A partial dependency exists when the value of a non-key attribute depends,on only a part of the 


primary key, rather than the entire key. 
What is a transitive dependency in database management system? 


A transitive dependency occurs when a non-key attribute depends on another non-key attribute, rather 


than on the primary key directly. 
What is normalization in database management system? 


Normalization is the process of organizing data in a.database to reduce redundancy and dependency. It 
involves breaking down a table into smaller tablestto eliminate repeating groups and to ensure that each 


table represents a single entity. 
What is redundancy in database design? 


Redundancy occurs when the same data is repeated in multiple tables or fields within a table, leading to 


data inconsistency and wastage of storage space. 
What is an insertion anomaly in database design? 


An insertion anomaly occurs when it is not possible to insert new data into a table without also adding data 


to another table. 
What isan updation anomaly in database design? 


Ar.updation anomaly occurs when updating data in a table leads to inconsistencies in other parts of the 


database. 
What is a deletion anomaly in database design? 


A deletion anomaly occurs when deleting data from a table inadvertently removes other data that should 


have been retained. 
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How can redundancy be eliminated in database design? 


Redundancy can be eliminated in database design through the process of normalization, which involves 


breaking up tables into smaller, more atomic tables and eliminating repeating groups. 
How can insertion anomalies be prevented in database design? 


Insertion anomalies can be prevented in database design by ensuring that each table has a primary k nd 


that all required fields are included in that key. Ae’ 


What is decomposition of relations in database management system? 4 
Decomposition of relations is the process of breaking down a larger relation nae er, more atomic 
relations to eliminate redundancy and improve data integrity. we 

What is a lossless join in database management system? © 


A lossless join occurs when decomposing a relation into smaller r Ps does not result in any loss of 


information or data. The original relation can be ow Y joining the smaller relations back 


together. <> 


What is the Dependency Preservation property in d management system? 


The Dependency Preservation property states QS) sy functional dependencies that exist in the original 
relation should also hold true for the a ad relations obtained after decomposition. In other words, 


decomposing a relation should not re the loss of any functional dependencies. 
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Long Question and Answer 


Discuss the Relational Database Management System. 
Definition: A Relational Database Management System (RDBMS) is a type of database management 
system that is based on the relational model, a mathematical concept for representing data.as a 
collection of tables with rows and columns. 

1. Data Storage: RDBMSs store data in tables and enforce relationships between tables through 


the use of keys. 


2. Data Retrieval: RDBMSs allow for efficient data retrieval through the use of SQL (Structured 


Query Language) to interact with the data stored in the tables: 


3. Data Integrity: RDBMSs help maintain data consistency and integrity through the use of keys 


and constraints, ensuring that the data stored inthe tables is accurate and consistent. 


4. Popular RDBMSs: Some of the most commonly used RDBMSs include Oracle, MySQL, 
Microsoft SQL Server, and PostgreSQL. 


5. Relational Model: The relational model and relational algebra form the foundation of RDBMSs, 


providing a formal way-to define, manipulate, and query data stored in tables. 


6. Tables and@iujples: RDBMSs use tables and tuples to represent data, where each table 


represents a relation and each tuple represents a single instance of the relation. 
7. Attributes: Each tuple has a fixed number of attributes, which are named and have a specific 


data type, and represent different pieces of information about the instance represented by 


the tuple. 
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8. SQL: The RDBMS uses SQL to interact with the data stored in the tables, allowing users to 


insert, update, and retrieve data in a way that is consistent with the relational model and 


relational algebra. 


2. Define Relation giving a graphical representation. 
A relation in the context of relational databases is a mathematical construct that represent Si 
of data. A relation consists of a set of tuples (rows) and a set of attributes (columns h tuple 


represents a single instance of the relation and has a fixed number of attributes, which are named 
and have a specific data type. Ss 


A relation can be thought of as a two-dimensional table, where th ws represent tuples and the 


columns represent attributes. Relations are used to model data Na ructured and organized way, 


making it easier to perform operations and queries on the d 
For example, a relation could be a table of emplo formation, where each tuple represents a 
single employee and each attribute represents(a pI ce of information about that employee, such as 


Id, name, post, and phone salary. gs important concept in relational databases, and they 


form the foundation of relational database management systems (RDBMSs). 


Relation: Employee rm 


Domain 


Columns or Fields or Attributes 5000, 100000 
ey MPI | | ENAME | ost | Salary 
Key | EMP_IL | POST Salary 


+ 
ee a oe 

2s 

Records £6 


Data Value 


Degree (No. of Columns) = 4 
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3. What are the main characteristics of relation? 


The main characteristics of a relation in a relational database management system (RDBMS) are: 


1. Tabular Structure: A relation has a tabular structure, with each tuple representing a row and 


each attribute representing a column. 


2. Unique Names: Each attribute of a relation has a unique name, which is used toddentify the 


attribute in the relation. 


3. Data Types: Each attribute in a relation has a specific data type, such as.integer, string, or date, 


which determines the kind of data that can be stored in the attribute. 


4. Values: Each attribute in a relation can have a specific set of values, depending on the data 


type of the attribute. 


5. Null Values: A relation may contain null values, which indicate that a value is missing or 


unknown. 


6. Order Independence: The-order of tuples and attributes in a relation is not significant and does 


not affect the meaning-of the relation. 
7. Degree: The number of attributes in a relation is called the degree of the relation. 
8. Cardinality: The number of tuples in a relation is called the cardinality of the relation. 


9. »Referential Integrity: RDBMSs use referential integrity constraints to enforce relationships 


between relations, ensuring that data stored in the tables is accurate and consistent. 


These characteristics define the structure and behavior of a relation in a relational database, and they 


play a critical role in the design and implementation of relational database management systems. 
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4. How RDBMS is built based on relation algebra? 
RDBMSs are built based on relational algebra, which is a mathematical language used to describe 
operations on relations (tables). The relational model and relational algebra form the foundation of 
RDBMSs, providing a formal way to define, manipulate, and query data stored in tables. 


In relational algebra, relations are treated as mathematical sets, and operations such a Bn 


intersection, difference, and selection can be performed on them. This allows for the ma tion of 
data in a way that is both efficient and mathematically sound. Ss 
5. What is the Use of Codd’s Rule in DBMS? ce 


Codd's 12 rules are used to determine whether a relational ew agement system is a true 


S 
Rs 


relational database management system or not. 


6. Describe the Codd's rules. <> 
Codd’s rule in DBMS also known as Codd's 12 r ommandments is a set of thirteen rules 
(numbered 0 to 12). If a database follows C 2 rules, it is called a True relational database 


management system. * 
These rules were originally set out in Dri Edgar F. Codd and were developed further by him in 
1985. 


G 


Codd's 12 rules 


5 


Foundation Rule 
Information Rule 
Guaranteed Access 
[ 3 | Systematic treatment of null values 
Active online Catlouge 


[5 | Powerful and well structured language 


View Updation Rule 


7 Relational Level Operation 
Physical Data Independence 
Logical Data Independence 
Integrity Independence 
Distribution Independence 


Non-Subdivison Rule 


iw) 
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Rule 0: The Foundation Rule 

For a system to be qualified as a relational database management system, it must be able to 
manage databases entirely through its relational capabilities. 

Rule 1: The Information Rule 

The information in a relational database must be stored in columns or rows of a table, i.e., a 
cell. 

Rule 2: The Guaranteed Access Rule 

Each and every datum in a relational database must be logically accessible using the 
combination of the table name, primary key value, and column name. . 

A datum is an atomic value, i.e., a piece of information that cannotbebroken down further. 
Rule 3: Systematic Treatment Of NULL Values 

NULL values are fully supported in a relational database andsrepresent missing information or 
inapplicable information in a systematic way, independent of the data type. NULL values are 
different from empty strings, blank spaces, and 00. 

Rule 4: Active Online Catalog 

Structure of database must be stored in.anonline catalog which can be queried by authorized 
users. 

Database Description (Catalog) of.a complete database must be stored online. The rules of the 
rest of the database mustvalso apply to the Catalog. The query language used to query the 
database should be used for the catalog also. 

Rule 5: The Comprehensive Data Sublanguage Rule 

A database_should be accessible by a language supported for definition, manipulation and 
transaction management operation. 

Rule: The View Updating Rule 

Theoretically, updatable views are also practically updatable by the database system. 

Rule 7: High-Level Insert, Update & Delete Rule 

The database system must follow high-level relational operations such as insertion, updation, 
and deletion at each level or row by row. It also supports the union, intersection, and 
subtraction operations in database systems. 


Rule 8: Physical Data Independence 


Omprakash Chandrakar _ | Essentials of RDBMS Page |8 


Question Bank: Unit 1-Relational Database Design 


The working of a database system should be independent of the physical storage of its data. 
If a file is modified (renamed or moved to another location), it should not interfere with the 
working of the system. 

Rule 9: Logical Data Independence 

If there is a change in the logical structure (table structure) of the database, the user view of 
the data must not change. 

Say a table is partitioned into two tables, the new view should give the result as the join of the 
two tables. A 

Rule 10: Integrity Independence an 

Integrity constraints specific to a particular relational database nh: be defined in the 
relational data sub-language and stored in the catalog and not.in RE application. 

Rule 11: Distribution Independence . Ss } 

A database should work properly regardless of its disttibution across a network. The end-user 
should not be able to see that the data is distributed over many locations, they should always 
get the impression that the data is located ata OF site only. 

Rule 12: The Non-subversion Rule ss) 

If a relational system allows low-level actece, that low cannot be used to subvert or bypass the 


integrity rules to modify the “data. This can be achieved with some sort of looking or 


encryption. 
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You should also know about: 
Relation in the context of set theory 
Relation refers to a relationship between the elements of 2 sets A and B. It is represented by R. We 


say that R is a relation from A to A, then R © AxA. A relation from set A to set B is a subset of AxB. 


A relation is a mathematical construct that defines a connection between elements of two sets. A 
relation is typically represented as a set of ordered pairs, where each ordered pair contains an 
element from the first set and an element from the second set. The ordered pair indicates that the 


elements are related in some way. 


For example, consider the sets A = {1, 5, 8} and B = {a, b, c}. Arelation R between A and B could be 
defined as R = {(1, c), (5, a), (8, b)}, indicating that the elements 1, 5, and 8 in set A are related to 


the elements c, a, and b in set B. 


Relation 
{(1,c) , (5,a) , 8,b) } 


In the context of relational databases, a relation is often used to model a table of data, where the 
rows of the table are treated as elements of a set and the columns are treated as attributes of the 
elements. The ordered pairs in the relation then correspond to the rows in the table, and the 


elements in the ordered pairs correspond to the values in the columns. 


Relations are a useful way to represent and organize data, allowing for efficient data retrieval and 


manipulation. 


7. What is functional dependency? Explain giving example. 
Functional dependency refers to a relationship between two attributes (or columns) in a table, where 


the value of one attribute uniquely determines the value of another attribute. 
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For example, consider the following table Employees. 


Employee ID Name Department Code | Manager Name 
1001 Alice Smith DOO1 John Johnson 
1002 Bob Johnson DOO2 Sarah Lee 
1003 Carol Chen DOO1 John Johnson 
1004 Dave Davis D003 George Wong 
1005 Emily Kim D002 Sarah Lee 


In this example, we can see that there are two functional dependencies: 
e Department Code --> Manager Name 
The department code uniquely determines the manager name; 
We can say that Manager Name is functionally dependent on Department Code. 
e Employee ID --> Name (since the employee ID uniquely, determines the employee name) 
e Employee ID --> Department Code 


e Employee ID --> Manager Name 


8. What do you mean by determinant and@ependent in functional dependency? 
A determinant is an attribute (or set of attributes) that uniquely determines the value of another 
attribute in the same table. 
A dependent is an attribute-(or set of attributes) whose values depend on the value of the 


determinant. In other words, the dependent attribute is functionally dependent on the determinant. 


StudentID | StudentName Class 
101 Alice 10th 
102 Bob 11th 
103 Charlie 10th 
104 Dave 9th 
105 Eve 12th 
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In this table, we can see that the "StudentID" attribute uniquely determines the values of the 
"StudentName" and "Class" attributes. 


To represent this functional dependency, we can write it as follows: 


StudentID > StudentName, Class 


Determinant Dependents 
"StudentID” determines the values of the "StudentName" and "Class". 
In other words: 

"StudentName" is functionally depends on "StudentID”. 


"Class" is functionally depends on "StudentID”. 


9. Identify the functional dependencies exist in the following table. 


Employee Table 


Employee_Id | Employee_Name | Employee_Department | Salary 
1 Ryan Mechanical $5000 
2 Justin Biotechnology $5000 
3 Andrew Computer Science $8000 
4 Felix . Human Resource 510000 


10. What are the different types of functional dependencies? Explain giving examples. 
There are four types of Functional Dependencies exist in RDBMS. 
1, Trivial functional dependency 
2. Non-Trivial functional dependency 
3. Multivalued functional dependency 


4. Transitive functional dependency 


1. Trivial Functional Dependency in DBMS 
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In Trivial functional dependency, a dependent is always a subset of the determinant. In other 
words, a functional dependency is called trivial if the attributes on the right side are the subset of 
the attributes on the left side of the functional dependency. 

X > Y is called a trivial functional dependency if Y is the subset of X. 


For example, consider the Employee table below. 


Employee_Id Name Age 
1 Zayn 24 
2 Phobe 34 
3 Hikki 26 
4 David 29 


Here, { Employee_Id, Name } > { Name } is a Trivial fufctional dependency, since the 
dependent Name is the subset of determinant { Employee_td, Name }. 


{ Employee_Id } > { Employee_lId }, { Name } > { Name } and { Age } > { Age } are also Trivial. 


2. Non-Trivial Functional Dependency in DBMS 
It is the opposite of Trivial functional, dependency. In Non-Trivial functional dependency, 
dependent if not a subset of the determinant. 


For example, consider the Employee table below. 


Employee_Id Name Age 
1 Zayn 24 
2 Phobe 34 
3 Hikki 26 
4 David 29 


Here, { Employee_Id } > { Name } is a_ non-trivial functional dependency 
because Name(dependent) is not a subset of Employee_Id(determinant). 


Similarly, { Employee_Id, Name } > { Age } is also a non-trivial functional dependency. 
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3. Multivalued Functional Dependency in DBMS 


In Multivalued functional dependency, one (or more) attributes determine multiple values for 


another attribute. 


For example, consider the City table below. 


City PIN Code 
Bangalore 560001 
Bangalore 560002 
Bangalore 560003 
Chennai 600001 
Chennai 600002 
Mumbai 400001 
Mumbai 400002 
Mumbai 400003 
New Delhi 110001 


In this table, the City attribute determines the PinCode attribute. However, there can be multiple 


pin codes for a single city. 


Here, { City } > { PIN Code } is a Multivalued functional dependency, since we get multiple values 


for PIN Code for a given city: 


4. Transitive Functional Dependency in DBMS 


Consider two funétional dependencies A > B and B > C then according to the transitivity axiom A 


> C must also.exist. This is called a transitive functional dependency. 


For example, consider the Employee table below. 


Employee_Id Name | Department Street Number 
1 Zayn CD 11 
2 Phobe AB 24 
3 Hikki CD 11 
4 David PQ 71 
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Employee_Id Name | Department Street Number 


5 Phobe LM 21 


Here, { Employee_Id > Department } and { Department > Street Number } holds true. Hence, 
according to the axiom of transitivity, { Employee_Id > Street Number } is also a valid functional 


dependency. 


11. Identify the functional dependencies exist in the following tables. 


Table 1: Orders 


OrderlID | CustomerID OrderDate OrderTotal 


1001 C001 2022-01-01 500.00 
1002 C002 2022-01-02 750.00 
1003 C003 2022-01-03 1000.00 


Table 2: Employees 


EmployeelD | EmployeeName | Department | Salary 


E001 Alice Sales 50000.00 
E002 Bob Marketing | 60000.00 
E003 Charlie HR 55000.00 


Table 3: Students 


StudentID StudentName Class | Section 
101 Alice 10th A 
102 Bob 11th B 
103 Charlie 10th A 
104 Dave 9th C 
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StudentID StudentName Class Section 


105 Eve 12th B 


12. Give an example of multivalued functional dependency. 


Car 

Car_model Maf_year Color 
H001 2017 Metallic 
HOO1 2017 Green 
HOO05 2018 Metallic 
HO05 2018 Blue 
H010 2015 Metallic 
H033 2012 Gray 


In this example, maf_year and color are independent of each other but dependent on car_model. In 


this example, these two columns are said to be multivalued dependent on car_model. 


This dependence can be represented like this: 


car_model ->> maf_year 


car_model->> colour 


13. Give‘an example of all types of functional dependencies taking a table. 


Consider the given Employee table. 


Employee_Id Name __|Department/ Street Number 


1 Zayn CD 11 


2 Phobe AB 24 
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Employee_Id Name __|Department/ Street Number 
3 Hikki CD 11 
4 David PQ 71 
5 Phobe LM 21 


1. Trivial functional dependency: wt 
Employee_ld + Name > Employee_lId sv 


Employee_Id + Name > Name & 


2. Non-Trivial functional dependency: 


Employee_Id > Name 
> 


Employee_Id > Department 


3. Multivalued functional dependency: S 
Department >> Employee_Id Cy 
QS 


Employee_Id > > StreetNumbe 


<° 
4. Transitive functional dep Y" 


Employee_Id > rtment and Department > StreetNumber 


> Employee ta > StreetNumber 


14. What is normalize 


Normalization(s the process of organizing data in a database in a way that reduces data redundancy 
and ens data integrity. It involves dividing larger tables into smaller ones and establishing 


relati ips between them. 


15. Which processes are involved in normalization? 
The process of normalization involves 
1. breaking down larger tables into smaller ones 


2. identifying relationships between tables 
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3. establishing foreign keys and referential integrity constraints to ensure that the data in the 


tables is consistent and accurate. 


16. For what purpose normalization is used? 
Normalization is used to eliminate database anomalies. Database anomalies are inconsistencies and 
errors that can occur when data is stored in a denormalized table. These anomalies can occur When a 
table contains redundant data or when there are dependencies between non-key attributes, 
17. What is database anomalies? 
Anomaly means the inconsistency occurred in the relational table during the operations performed 
on the relational table. 
There can be various reasons for anomalies to occur in the database. 
1. There is a lot of redundant data present in our database:then DBMS anomalies can occur. 
If a table is constructed in a very poor manner thenthere is a chance of database anomaly. 
Due to database anomalies, the integrity of the’database suffers. 


2. The other reason for the database anomaliesiis that all the data is stored in a single table. 


So, to remove the anomalies of the database, normalization is the process which is done where the 


splitting of the table and joining of the table (different types of join) occurs. 


18. What are the different types oftariomalies present in database? Explain them taking the example 


of following table. 


Worker_id Worker_name Worker_dept Worker_address 
65 Ramesh ECTOO1 Jaipur 

65 Ramesh ECTOO2 Jaipur 

73 Amit ECTOO2 Delhi 

76 Vikas ECT501 Pune 

76 Vikas ECT502 Pune 

79 Rajesh ECT669 Mumbai 
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There can be three types of an anomaly in the database: 


1. Updation Anomaly 


When we update some rows in the table, and if it leads to the inconsistency of the table then 
this anomaly occurs. 

In the above table, if we want to update the address of Ramesh then we will have,to update 
all the rows where Ramesh is present. If during the update we miss any single row, then there 


will be two addresses of Ramesh, which will lead to inconsistent and wrong databases. 


Insertion Anomaly 


If there is a new row inserted in the table and it creates the inconsistency in the table then it 
is called the insertion anomaly. 
For example, if in the above table, we create anew row of a worker, and if it is not allocated 


to any department then we cannot insertiit in the table so, it will create an insertion anomaly. 


Deletion Anomaly 


If we delete some rows from the table and if any other information or data which is required 
is also deleted from the database, this is called the deletion anomaly in the database. 

For example, in the above table, if we want to delete the department number ECT669 then 
the details of Rajesh will also be deleted since Rajesh's details are dependent on the row of 


ECT669. So, there will be deletion anomalies in the table. 


To remove this type of anomalies, we will normalize the table or split the table or join the tables. 


19. What are the different types of anomalies present in the following table? 


stu_id stu_name stu_address stu_club 
220 Annamalai Kerala yoga 
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stu_id stu_name stu_address stu_club 

220 Muthu Kerala Music 

231 Mukesh Mumbai Crypto 

232 Muni Karnataka Public Speaking 
232 Muni Karnataka Arts 


There can be three types of an anomaly in the database: 


1. Updation Anomaly 


For student Muthu, we have two columns in the above table as he belongs to two clubs at the 
college. If we want to change Muthu's address, we must update it twice otherwise the data 


will be inconsistent. 


When the correct address gets updated in one club but not in another, Muthu would possess 
two different addresses, which is not accéptable and could result in inconsistent data. 


2. Insertion Anomaly 


In the above table if a new student named Nanda has joined the college and he has no 
department affiliation asthe club allows intake of students only from second year. Then we 


can't insert the data of Nanda into the table since the st_club field cannot accept null values. 


3. Deletion Anomaly 


Suppose, for instance, the college at some point closes the club crypto, then deleting the rows 
that contain s_club as crypto would also delete the information of student Mukesh since he 


belongs only to this department. 


20. When insertion, updation and deletion anomalies may occur? 


A database anomaly is a flaw in the database that occurs because of poor planning and redundancy. 
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1. Insertion anomalies: This occurs when we are not able to insert data into a database because some 
attributes may be missing at the time of insertion. 

2. Updation anomalies: This occurs when the same data items are repeated with the same values and are 
not linked to each other. 

3. Deletion anomalies: This occurs when deleting one part of the data deletes the other necessary 


information from the database. 


What is Database Normalization? 

Normalization is a process in database design that involves organizing the data in ayway that reduces 
redundancy and dependency among tables. The goal of normalization is to eliminate data anomalies, 
such as insert, update, and delete anomalies, and ensure that data is consistent, accurate, and easy 


to maintain. 
How do you perform Database Normalization? Explain INFp2NF, and 3NF in brief. 


Normalization is typically performed by breaking down large tables into smaller, more manageable 
tables, each containing a single theme or concept. This process is usually performed in a series of 
steps, referred to as normal forms. The,smost commonly used normal forms are first normal form 


(1NF), second normal form (2NF), and third normal form (3NF). 


1. First normal form (1NF): In@NF, each table column must contain only atomic values, that is, indivisible 


values. For example, atable column should not contain comma-separated lists of values. 


2. Second normal form (2NF): In 2NF, each table must have a primary key, and every non-key column must 
be functionally dependent on the entire primary key. This means that if a table has composite primary key 
(ies, aprimary key that consists of multiple columns), each non-key column must depend on the entire 


composite key, not just on one part of it. 


3. Third normal form (3NF): In 3NF, each non-key column must be functionally dependent only on the primary 


key, not on any other non-key column. This means that a table must not have transitive dependencies, 
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where one non-key column depends on another non-key column, which in turn depends on the primary 


key. 


23. Explain the normalization process giving example. 
First Normal Form (1NF) 
A relation is in 1NF if every attribute is an atomic attribute. (i.e., every attribute is a single-valued 
attribute.) 


EmployeeDetail 


Employee Code | Employee Name | Employee Phone Number 
101 John 98765623,998234123 
101 John 89023467 
102 Ryan 76213908 
103 Stephanie 98132452 


Here, the Employee Phone Number is not an atomic value bUtit contains multiple values. So, this the 


violation of 1 NF. Therefore the given relation is not in. NF: 


To convert this table into INF, we make new rows with each Employee Phone Number as a new row 
as shown below: 


EmployeeDetail 


Employee Code | Employee Name.) Employee Phone Number 
101 John 998234123 
101 John 98765623 
101 John 89023467 
102 | Ryan 76213908 
103 Stephanie 98132452 


Second Normal Form (2NF) 
1yv The table must be in first normal form. 
2. All non-prime attributes are fully functionally dependent on the primary key. (It must not 


contain any partial dependency.) 
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EmployeeProjectDetail 


Employee Code | Project ID | Employee Name | Project Name 
101 PO3 John Project103 
101 PO1 John Project101 
102 P04 Ryan Project104 
103 PO2 Stephanie Project102 


In the above table, the primary key is {Employee Code + Project ID}. 


We have partial dependencies in this table because 
Employee Code > Employee Name 


Project ID > Project Name can be determined by Project ID. 


Thus, the above relation violates the rule of 2NF. 


To remove partial dependencies from this table and.normalize it into second normal form, we can 


decompose the EmployeeProjectDetail table intojthe following three tables: 


EmployeeDetail 


Employee Code | Employee Name 
101 John 
101 John 
102 Ryan 
103 Stephanie 


EmployeeProject 


Employee Code | Project ID 
101 P03 
101 PO1 
102 P04 
103 PO2 
ProjectDetail 
Project ID Project Name 
PO3 Project103 
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Project ID Project Name 
PO1 Project101 
P04 Project104 
PO2 Project102 


Third Normal Form (3NF) 
For a relational table to be in third normal form, it must satisfy the following rules: 
1. The table must be in the second normal form. 
2. All non-prime attribute depends on only primary key. (No non-prime attribute dependents on 
any other non-prime attributes.) 


EmployeeDetail 


Employee Code | Employee Name | Employee Zipcode | Employee City. 
101 John 110033 Model Town 
101 John 110044 Badarpur 
102 Ryan 110028 Naraina 
103 Stephanie 110064 Hari Nagar 


The above table has following functional dependency: 
Employee Zipcode -> Employee City 
Here, Employee Zipcode and Employee Gity both are non-prime attributes. And this is the violation of 


3 NF rule. Therefore the above table isnot in 3NF. 


To normalize it into the third normal form, we can decompose the <EmployeeDetail> table into the 
following two tables: 


EmployeeDetail 


Employee Code | Employee Name | Employee Zipcode 
101 John 110033 
101 John 110044 
102 Ryan 110028 
103 Stephanie 110064 


EmployeeLocation 


Employee Zipcode | Employee City 
110033 Model Town 
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Employee Zipcode | Employee City 
110044 Badarpur 
110028 Naraina 
110064 Hari Nagar 


24. What is Decomposition of database? Explain the rules for decomposition. 


Decomposition is a process of breaking (decomposing) a relation into multiple relationsito bring it into 
an appropriate normal form. It helps to remove redundancy, inconsistencies, and anomalies from a 
database. 
Rules for Decomposition 

1. Lossless Join Decomposition. 


2. Dependency Preserving. 


1. Lossless Join Decomposition 


A lossless Join decomposition ensures two things: 
1. No information is lost while decomposing from the original relation. 


2. If we join back the sub decomposed relations, the same relation that was decomposed is obtained. 


Let’s see an example of a lossless join decomposition. Suppose we have the following relation 
EmployeeProjectDetail as: 


EmployeeProjectDetail 


Employee_Gode | Employee_Name Employee_Email Project_Name | Project_ID 
101 John john@demo.com Project103 P03 
101 John john@demo.com Project101 PO1 
102 Ryan ryan@example.com Project102 PO2 
103 Stephanie stephanie@abc.com Project102 PO2 


Now, we decompose this relation into EmployeeProject and ProjectDetail relations as: 


EmployeeProject 


Employee_Code|Project_ID/Employee_Name| Employee_Email 


101 PO3 John john@demo.com 
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Employee_Code|Project_ID/Employee_Name| Employee_Email 
101 PO1 John john@demo.com 
102 P04 Ryan ryan@example.com 
103 PO2 Stephanie stephanie@abc.com 


The primary key of the above relation is {Employee_Code, Project_ID}. 


ProjectDetail 
Project_ID Project_Name 
PO3 Project103 
P01 Project101 
P04 Project104 
PO2 Project102 


The primary key of the above relation is {Project_ID}. 
Dependency Preserving 


It says that after decomposing a relation R into R1 and R2, all dependencies of the original relation 
R must be present either in R1 or R2 


Let’s understand this from the same example,above: 


Table EmployeeProjectDetail has following functional dependencies: 
Employee_Code -> {Employee_Name, Employee_Email} 


Project_ID - > Project_Name 


Now, after decomposing the relation into EmployeeProject and ProjectDetail, it have following 


functional dependencies: 


EmployeeProject has: Employee_Code -> {Employee_Name, Employee_Email} 
And 


ProjectDetail has: Project_ID - > Project_Name 


As we can see that all FDs in EmployeeProjectDetail are either part of the EmployeeProject or the 


ProjectDetail, So this decomposition is dependency preserving. 
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25. Identify the normal form of the given table and convert it into 3"? normal form. 


CourseCode | Course_Name Classroom | InstructorName | ContactNo City PINCode 
CS101 RDBMS A101 Rajesh Gupta 9876543210 | Mumbai | 400001 
9876543220 
ES201 ES B202 Priya Sharma 9831045678 | Delhi 110001 
MTH301 Maths C303 Amit Kumar 9876543201 | Bangalore} 560001 
PHY401 Quantum D404 Rajesh Gupta 9876543210 | Mumbai | 400001 
Mechanics 9876543220 
CHEM501 Organic Chemistry | E505 Anjali Singh 9832476512 | Lucknow | 226001 
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